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ABSTRACT 

Zinc-finger recombinases (ZFRs) are chimaeric pro- 
teins comprising a serine recombinase catalytic do- 
main linked to a zinc-finger DNA binding domain. 
ZFRs can be tailored to promote site-specific 
recombination at diverse 'Z-sites', which each com- 
prise a central core sequence flanked by zinc-finger 
domain-binding motifs. Here, we show that purified 
ZFRs catalyse efficient high-specificity reciprocal 
recombination between pairs of Z-sites in vitro. No 
off-site activity was detected. Under different reac- 
tion conditions, ZFRs can catalyse Z-site-specific 
double-strand DNA cleavage. ZFR recombination 
activity in Escherichia coli and in vitro is highly de- 
pendent on the length of the Z-site core sequence. 
We show that this length effect is manifested at 
reaction steps prior to formation of recombinants 
(binding, synapsis and DNA cleavage). The design 
of the ZFR protein itself is also a crucial variable 
affecting activity. A ZFR with a very short (2 amino 
acids) peptide linkage between the catalytic and 
zinc-finger domains has high activity in vitro, 
whereas a ZFR with a very long linker was less 
recombination-proficient and less sensitive to vari- 
ations in Z-site length. We discuss the causes of 
these phenomena, and their implications for prac- 
tical applications of ZFRs. 

INTRODUCTION 

Site-specific recombinases are enzymes that promote pro- 
grammed DNA rearrangements by breaking and rejoining 
DNA strands at specific sequence loci. Many site-specific 
recombinases belong to one of two large groups of 



enzymes called the serine recombinases and the tyrosine 
recombinases. These two groups are structurally unrelated 
and promote recombination by different mechanisms (1). 
Site-specific recombinases have become essential tools for 
the experimental manipulation of transgenic DNA se- 
quences within the cells of living organisms. The bacterio- 
phage tyrosine recombinase Cre, the yeast tyrosine 
recombinase FLP and the bacteriophage cbC31 serine 
integrase have been the enzymes most commonly used 
for these purposes to date, but many other microbial re- 
combinases have been shown to have potential utility (2). 
The recombination sites for these enzymes (typically 
~30^40bp) are normally incorporated into the design of 
a transgenic construct which is introduced into the 
organism being studied. This need for exogenous recom- 
bination sites restricts the current applications of 
site-specific recombination to genetically modified organ- 
isms. There has therefore been much interest in the devel- 
opment of methods to reprogramme recombinases to act 
on natural genomic sequences (3). 

'Directed evolution' methods have successfully produced 
re-targeted recombinase variants (4-8). However, this 
strategy might not be applicable to targets that differ 
very substantially from the natural site for the progenitor 
recombinase (9). An alternative approach, pioneered in 
this lab, is to direct the recombinase catalytic domain to 
the chosen target site by linking it to a DNA binding 
domain which is engineered to recognize an adjacent 
sequence motif. Initially, we used a catalytic domain 
derived from the serine recombinase Tn3 resolvase, 
linked to a zinc-finger DNA binding domain from the 
mouse transcription factor Zif268 (10) (Figure 1). These 
zinc-finger recombinases (ZFRs; previously referred to as 
Z-resolvases) were shown to promote efficient site-specific 
recombination in Escherichia coli (10). The 'Z-sites' 
recombined by the ZFRs each consisted of a central 'core' 
sequence from binding site I of the TnJ recombination site 
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res, flanked by 9 bp motifs recognized by the zinc-finger 
domains (Figure IB). ZFRs can be modified to target a 
wide range of different Z-sites, by substituting the 
zinc-finger domain with modified altered-specificity 
versions, by mutagenizing the resolvase catalytic domain 
to alter or reduce its specificity, and by using catalytic 
domains from other serine recombinases (3,11-13). Tn3 
resolvase-derived ZFRs are predicted to bind to Z-sites 
as dimers (Figure IB), but might be monomeric in 
solution (14), and thus could recognize asymmetric 
Z-sites as on-site heterodimers (12,13). Their mode of 
target recognition can thus be compared to that of 
zinc-finger nucleases (15,16). 

Except for a preliminary in vitro experiment (10), all 
analysis of ZFR activity to date has been in vivo (in 
E. coli and in human cell lines) (10,12,13,17-19). 
However, a more detailed understanding of the design, 
function and mechanism of these enzyme systems is essen- 
tial if they are to be established as practical tools for ma- 
nipulation of genomic DNA sequences. Many unanswered 
questions remain. For example, the design of the linker 
between the ZFR recombinase and zinc-finger domains 
(i.e. its length and sequence) may still be far from 
optimal; the effects of the Z-site length and sequence on 
activity and specificity are still not well understood; the 
possible correlation between Z-site core sequence length 
and the most effective ZFR linker length has not been 
investigated in any detail; and the steps in the reaction 
pathway that are affected by ZFR and Z-site design are 
not known. Furthermore, the known properties of 
activated serine recombinases suggest that ZFRs may 
catalyse undesirable side-reactions (such as accumulation 
of DNA strand breaks) to some extent (20); optimized 



design of the system components might minimize these 
problems and thus improve ZFR utility in vivo. These 
(and other) questions will be best addressed by studying 
the properties of purified ZFRs in vitro. Here, we report 
the establishment of an in vitro system for ZFR-mediated 
recombination, and use it to show how specific reaction 
steps are affected by ZFR design and Z-site core sequence 
length. 



MATERIALS AND METHODS 

Plasmids and in vivo recombination 

Plasmids encoding ZFRs, for assays in E. coli or for 
protein overexpression, were constructed as described pre- 
viously (10). Recombination substrate plasmids for assays 
in E. coli and in vitro were constructed as described (10). 
The Z-site names denote the length of the core sequence 
between the two 9 bp Zif268-binding motifs (Figure 2B); 
for example, Z22 has a 22 bp core sequence. In this work, 
the core sequence is exactly as in Tni res site I (21), with 
the centre of site 1 at the centre of the Z-site, unless stated 
otherwise. Plasmid DNA for in vitro assays was purified 
with a Qiagen midiprep kit. Full sequences of all plasmids 
are available on request. 

Assays for ZFR-mediated recombination in E. coli were 
essentially as described (13). To determine the extent of 
recombination, plasmid DNA was isolated. Following 
growth of transformants on MacConkey agar indicator 
plates, the cells were recovered from each plate in 1 ml 
of L-broth. One microlitre of the suspension was used to 
inoculate 10 ml of L-broth, and this culture was incubated 
overnight at 37°C with ampicillin and kanamycin 




binding synapsis cleavage recombination 



Figure 1. ZFR structure and activity. (A) Crystal structure (1GDT) of a yS resolvase dimer bound to DNA (res site I). The catalytic domains are 
blue, and the DNA binding domains are green. (B) Cartoon of a ZFR bound to a Z-site. The catalytic domains are blue, the zinc-finger domains are 
yellow, and the linker peptides between the two domains of each subunit are magenta. The Z-site core sequence is grey, and the motifs bound by the 
zinc-finger domains are orange. (C) Hypothetical pathway for ZFR-mediated site-specific recombination. Following binding to the Z-sites and 
synapsis of two sites, the four ZFR subunits each catalyse cleavage of one DNA strand, forming two double-strand breaks with a ZFR covalently 
linked to each end (the covalent links are not shown). A 180° rotation of one pair of DNA ends and its attached ZFRs relative to the other pair, 
followed by re-ligation of the broken ends, completes recombination. 



9318 Nucleic Acids Research, 2011, Vol. 39, No. 21 



R148 



R144 




B 

Tn3R 
ZFR(L1) 
ZFR(L2) 
ZFR(L3) 
ZFR(L4) 
ZFR(L5) 
ZFR(L6) 
ZFR(L7) 
ZFR(L8) 
ZFRB1 
ZFRB2 

c 

Tn3 site I - 

Z16 
Z18 
Z20 
Z22 
Z24 
Z26 
Z28 



■3- 



co 



FGRRRTVDRNVVLTL ■ ■ ■ 
FGRRRTVD0-SSDPTSQTS 
FGRRR- -SDYTQNN I HPTS 
FGRRRNRVAQQLAGKQSTS 
FGRRR GSGGSGTS 

FGRRR GSGGSGGSGTS 

FGRRRGSGGSGGSGGSGTS 

FGRRR GGGSGGGTS 

FGRRRGGGSGGGGSGGGTS 

FGRRRTVDg TS 

FGRRR TS 



ERPY ' 
ERPY ' 
ERPY' 
ERPY' 
ERPY' 
ERPY' 
ERPY' 
ERPY' 
ERPY' 
ERPY' 



-CGTTCGAAATATTATAAATTATCAGACA- 



GCGTGGGCGAAATATTATAAAT 

GCGTGGGCGGAAATATTATAAAT 

GCGTGGGCGCGAAATATTATAAAT 

GCGTGGGCGTCGAAATATTATAAAT 

- -GCGTGGGCGTTCGAAATATTATAAAT 
-GCGTGGGCGGTTCGAAATATTATAAAT 
GCGTGGGCGCGTTCGAAATATTATAAAT 



TATCGCCCACGC 

TATCCGCCCACGC 

TATCACGCCCACGC 

TATCAGCGCCCACGC 

TATCAGACGCCCACGC- - 
TATCAGACCGCCCACGC - 
TATCAGACACGCCCACGC 



Figure 2. Variable features of ZFR-mediated recombination. (A) Structure of the 'arm' region of resolvase (residues 120-148 are shown) bound to 
DNA (data are from the crystal structure 1GDT). The protein backbone is shown as cartoon, and the DNA in semi-transparent surface represen- 
tation. Sidechains of F140, R142, R144 and R148 are shown as sticks, and the a-carbon of G141 is shown as a sphere. (B) Alignment showing the 
linker sequences of ZFRs tested (and all found to be recombination-proficient) by Akopian et al. (10), and the new ZFRs reported here (names in 
bold). Resolvase sequence is in blue, linker sequence is in magenta, and Zif268 sequence is in orange. Variants of ZFRB1 are described in the 
'Results' section. Resolvase residues R144 and R148 are boxed. (C) The recombination sites (Z-sites) used in this study, aligned with site I of Tni res 
(only the top strands are shown). The motifs bound by the Zif268 domains are in orange. The motifs in site I bound by the resolvase C-terminal 
domain are in green. The central AT dinucleotide of each site is in bold. 



selection. Plasmid DNA was prepared using a Qiagen 
miniprep kit, and analysed by agarose gel electrophoresis. 

The method for selection of ZFRs with an enhanced 
rate of recombination in E. coli was adapted from Burke 
et al. (22). The coding sequence for ZFRB1 (see 'Results' 
section) was subjected to PCR mutagenesis and then 
cloned to create a library of mutant expression plasmids. 
The selection involved a plasmid pStr(Z22), which encodes 
a strA gene conferring sensitivity to streptomycin flanked 
by Z22 sites. The Z22 site sequences are as described in 
Akopian et al. (10). Deletion of strA leads to streptomycin 
resistance. E. coli cells containing pStr(Z22) were trans- 
formed with the library DNA, and expression plasmids 
were isolated from streptomycin-resistant colonies that 
appeared at early time points. 



ZFR purification 

The purification procedure was adapted from that used 
for TnJ resolvase mutants, and was essentially as 
described (20), except that Zn 2+ was added to the buffer 
for refolding the protein after denaturing ion exchange 
chromatography. A full protocol is in Supplementary 
Data. 

In vitro recombination and cleavage assays 

In a typical recombination reaction, 6 ul of ZFR, diluted 
in a buffer containing 20 mM Tris-HCl (pH 7.5), ImM 
DTT, 1 M NaCl and 50% glycerol to a concentration of 
~4uM, was added to 60 ul of a solution of supercoiled 
substrate plasmid DNA (25 ug/ml) in a buffer containing 
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50 mM Tris-HCl (pH 8.2) and 10 mM MgCl 2 . Note the 
absence of chelating agents such as EDTA. The mixture 
was kept at 37°C for 1 h, then the reaction was stopped by 
heating at 70°C for lOmin. The sample was divided into 
three 22 ul aliquots which were either untreated, or 
digested with restriction enzymes, or nicked with DNase 
I, before separation by agarose gel electrophoresis. 
Procedures for DNase I nicking and gel electrophoresis 
were as described (20). The gels were 1.2% w/v agarose 
except for nicked samples, which were run on 0.7% gels. 
Excess protein and any recombinase subunits covalently 
linked to the DNA were destroyed by addition of loading 
buffer to all samples (20% of the final sample volume), 
which contained 100 mM Tris-HCl (pH 7.5), 50% v/v 
glycerol, 0.5% w/v SDS, 1 mg/ml protease K and 
0.5mg/ml bromophenol blue. Assays for ZFR-mediated 
cleavage followed a similar procedure except that the 
solution prior to ZFR addition contained plasmid DNA 
(25ug/ml), 50 mM Tris-HCl (pH 8.2) and 40% v/v 
ethylene glycol. Reactions were stopped as above, then 
SDS + protease K loading buffer was added and the 
products were separated on 1.2% agarose gels. 

Binding/synapsis assays 

The procedure was modified from Olorunniji et al. (20). 
The gel-purified 50 nt oligonucleotide which was to be the 
Z-site top strand was 3 P-phosphorylated at the 5'-end 
using T4 kinase, and was then annealed with an equimolar 
amount of unlabelled bottom strand. Diluted ZFR (2.2 ul) 
was added to 20 ul samples containing lOOnM Z-site 
DNA, 20 mM Tris-HCl (pH 7.5), 10 uM zinc acetate, 
20 ug/ml poly(dI/dC) and 4% w/v Ficoll. The final ZFR 
concentration was ~800nM (or as indicated otherwise). 
The samples were cooled in ice for lOmin, then loaded 
onto an 8% polyacrylamide gel (30:0.8 acrylamide: 
bisacrylamide). The gel contained lx TB buffer (89 mM 
Tris base, 89 mM boric acid), plus 10% v/v glycerol. The 
running buffer was 1 x TB. Gels were pre-run for 30min at 
200 V (~11 V/cm) and 4°C. Samples were then loaded and 
the gels were run for a further 4.5 h at 200 V under the 
same conditions, without buffer recirculation. Gels were 
dried, and bands were visualized by phosphor-imaging. 

Molecular modelling 

Models of ZFRs bound to Z-sites with different core 
sequence lengths were built in PyMol (23) using crystal 
structures of a yS resolvase dimer-DNA complex 
[1GDT; (24)], a synaptic jS resolvase-DNA intermediate 
[1ZR4; (25)] and a Zif268 zinc-finger domain-DNA 
complex [1AAY; (26)]. The required core sequence 
length was achieved by superimposition of appropriate 
deoxyribose C-l' atoms in the region of DNA overlap 
between the resolvase and Zif268 structures. 

RESULTS 

Redesigned ZFRs 

The first ZFRs (10) utilized the catalytic domain of 'NM 
resolvase', an activated Tn3 resolvase variant with six 



mutations [R2A E56K G101S D102Y M103I Q105L; 
(22)]. This domain, from the N-terminus to residue R144 
or R148, was followed by a linker of 8-14 amino acids, 
then the Zif268 DNA binding domain [starting at 
residue E2, in the domain as used for crystallography 
(26)] (Figure 2B). The C-terminal cut-off point of the 
resolvase domain was chosen so as to include all the resi- 
dues implicated in catalysis and subunit interactions, and 
most (-R144) or all (-R148) of the 'arm region' of 
resolvase [residues 121-147; (24)]. The arm region inter- 
acts with the minor groove near the centre of the recom- 
bination site (Figure 2A), and might therefore be 
important for efficient catalysis. The linkers between the 
recombinase and zinc-finger domains (Figure 2B) were 
made quite long and flexible, because it had been 
thought that conformational plasticity in this part of the 
protein might be desirable. However, it turned out that the 
length and sequence of these linkers had surprisingly small 
effects on ZFR-mediated recombination in E. coli (10), so 
we speculated that a shorter linker might be tolerated or 
even advantageous. 

We tested two new ZFRs with a short linker (Figure 2B). 
ZFRB1 consists of NM resolvase sequence from the 
N-terminus to residue R148, followed by the 2-amino 
acid linker TS, then the Zif268 DNA binding domain 
starting at residue E2. ZFRB2 is identical to ZFRB1 
except that the NM resolvase sequence is four amino 
acids shorter, ending at residue R144. The linker 
sequence TS [also used at the C-terminal ends of the 
linkers of Akopian et al. (10)] was chosen because it 
should be quite flexible, consisting of two relatively 
small hydrophilic residues, and allows a useful Spel re- 
striction site to be introduced into the DNA sequence. 

The new ZFRs were tested first in E. coli. The substrate 
plasmids each have two identical Z-sites flanking a galK 
marker gene. The Z-site core sequence length was varied 
from 16 to 28 bp in 2 bp increments, by putting sequence 
from the centre of TnJ res site 1 between the Zif268 
binding motifs (Figure 2C). Thus, the longest site tested, 
Z28, has a complete copy of site I (28 bp) at its centre. 
Strains containing substrate plasmids were transformed 
with a ZFR expression plasmid, and transformants were 
selected on MacConkey agar indicator plates. Deletion of 
the galK gene by ZFR-mediated recombination (Figure 3A) 
results in pale-coloured colonies, whereas galK^ colonies 
are red. The colony colour gives a visual indication of 
recombination activity, but to estimate activity more ac- 
curately we recovered the plasmid DNA (see 'Materials 
and Methods' section). ZFRB2 was completely inactive 
on all substrates, but ZFRB1 was recombination- 
proficient; pGal(Z20) and pGal(Z22) were completely 
resolved, pGal(Z24) was partially resolved, and the other 
plasmids were not resolved at all (Figure 3B). 

For the in vitro studies described below, we purified the 
following proteins: 

(1) ZFRB1 (see above). 

(2) ZFRB1SA, which is identical to ZFRB1 except for a 
mutation of the active site serine residue (S10) to 
alanine which renders the protein catalytically 
inactive. 
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Figure 3. ZFR-mediated recombination at Z-sites in E. coli. (A) The assay used to observe ZFR-mediated recombination in E. coli. Deletion 
of a galK marker gene causes a reduction in plasmid size and a change in colony colour on MacConkey indicator plates from red to pale. 
(B) Gels showing plasmid DNA recovered from E. coli cells after ZFR expression. Each substrate plasmid contained two identical Z-sites as 
cartooned in Figure 3A. The Z-site sequences are shown in Figure 2C. The ZFRs are described in the 'Results' section and Figure 2B. The 
annotation of the bands is as follows: exp, ZFR expression plasmid; nr, non-recombinant (substrate) plasmid; rec, recombinant plasmid (galK 
segment deleted). 



(3) ZFRB1WT, which is identical to ZFRB1 except 
that the recombinase domain is that of wild-type 
TnJ resolvase, not the activated resolvase mutant 
NM. This protein has no recombination activity on 
Z-sites in E. coli (data not shown). 

(4) ZFRB1VF, identical to ZFRB1 except for a 
mutation V107F in the resolvase domain. This 
mutant was selected for fast recombination activity 
in an E. coli assay (see 'Materials and Methods' 
section). When tested on the pGalK substrates, 
ZFRB1VF activities were similar to those of 
ZFRB1 (Figure 3B). 

(5) ZFR(L6). This protein has the highest activity in 
E. coli of the variants created by Akopian et al. 
(10). It has a long (14 amino acids) linker between 
the resolvase and Zif268 domains (Figure 2B). In E. 
coli, ZFR(L6) had recombination activity on pGalK 
substrates with a wider range of core sequence 
lengths than ZFRB1 (Figure 3B). 

(6) We also carried out comparative experiments with 
NM resolvase, purified as described previously (20). 



In vitro recombination by ZFRs 

First, we tested the activity of each ZFR on a supercoiled 
plasmid pRec(Z22), which contains two copies of Z22 (the 
Z-site that was recombined most efficiently in E. coli) 
(Figure 4A). Preliminary titration experiments were 
carried out with each ZFR to assess the dependence 
of activity on protein concentration (Supplementary 
Figure SI). pRec(Z22) was then incubated with an 
optimal concentration of each ZFR (~1 uM) at 37° C for 
1 h. Each sample was divided into three aliquots; one was 



not treated further, one was digested with Xhol, and one 
was treated with DNase I to nick covalently closed DNA. 
The aliquots were then loaded onto agarose gels and sub- 
jected to electrophoresis (Figure 4B). For comparison, a 
similar substrate pRec(site I) containing two copies of TnJ 
res site I was treated with NM resolvase under the same 
conditions. 

ZFRB1 recombined pRec(Z22) efficiently, giving a very 
similar product distribution to the NM resolvase reaction 
on pRec(site I) (Figure 4B). The expected resolution and 
inversion products were observed, along with smaller 
amounts of intermolecular recombination products. The 
resolution products were predominantly free circles, 
though a small amount of topologically complex species 
could be seen on nicking with DNase I. The predominance 
of topologically simple products may be a consequence of 
multiple rounds of recombination (20). In the 'uncut' 
samples, each recombinant circle product appeared as 
several bands. These bands are covalently closed topoiso- 
mers along with some nicked circles, and therefore both 
strands of the recombinant Z-site DNA have been 
re-ligated in a large fraction of the products. The topoiso- 
mers were converted to nicked circles on treatment with 
DNase I. There was also a noticeable amount of linearized 
substrate and linearized recombinant-length DNA in the 
uncut samples (Figure 4B), which might derive from in- 
completely ligated reaction intermediates. 

Surprisingly, ZFRB1VF, which had been isolated as a 
rate-enhanced mutant in vivo, gave significantly lower 
amounts of recombinant products than its parent 
ZFRB1. There were substantial amounts of nicked sub- 
strate DNA in the uncut sample, suggesting ZFR- 
mediated abortive cleavage of only one DNA strand. 
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Figure 4. ZFR-mediated recombination in vitro. (A) In vitro recombination of a plasmid substrate. In parts B and C, the plasmid used (pRec(Z22)) 
contained two 722 sites. In this diagram, the resolution and inversion recombinant products are shown, along with one example of a product of 
intermolecular recombination. Sites for restriction enzymes used in the analysis are marked. (B) Reaction products after treatment of pRec(Z22) with 
ZFRs were visualized by agarose gel electrophoresis either without further treatment (left panel), or after digestion with Xhol (centre panel), or after 
nicking with DNase I to remove supercoiling (right panel). The ZFR used is indicated above each lane. The right-hand lane of each panel shows the 
products from pRec(site I), which has two copies of Tni res site I instead of Z-sites, after treatment with the activated Tn5 resolvase mutant NM. 
The bands are annotated as follows: sc, supercoiled circle; nc, nicked circle; lin, linearized plasmid; rc, recombinant circle; topo, this zone contains 
several bands including topoisomers of substrates and products, and linearized recombinant DNA; inter, this zone contains the large products of 
intermolecular recombination; nr, non-recombinant; res, recombinant (resolution); inv, recombinant (inversion); kn/cat, this zone contains bands 
assigned as nicked knotted and catenated reaction products. Other, faint bands in the restriction digests derived from cleavage products are not 
labelled. (C) Reactions of pRec(Z22) with ZFRs in 40% ethylene glycol buffer, which inhibits re-ligation of cleaved intermediates. The annotation is 
as in part B, with the addition of: cleav, product of double-strand cleavage at both recombination sites. Cleavage at only one site gives a full-length 
linear product (labelled lin). All samples were treated with protease K (see 'Materials and Methods' section), so the ZFR subunits covalently linked 
to the cleaved DNA ends have been destroyed. (D) ZFRBI -mediated recombination of linearized pRec(Z22). The annotation of bands is as in (B and 
C). The expected products of intramolecular recombination are illustrated in the scheme to the right of the gel image. The bands are labelled as in 
(B), with the additional label rlin, linear 'resolution' product. 
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ZFR(L6) was also much less proficient for recombination 
than ZFRB1, and gave greater amounts of nicked sub- 
strate. Both ZFRB1VF and ZFR(L6) gave some linear 
products, as did ZFRB1 (see above). ZFRB1WT, whose 
catalytic domain is that of wild-type TnJ resolvase, was 
completely inactive in this assay, showing that activating 
mutations in the catalytic domain are essential for ZFR 
recombination proficiency. 

In a buffer containing a high concentration of ethylene 
glycol and no Mg 2+ , re-ligation of strands by resolvase 
is suppressed, and resolvase-linked products of cleavage 
at the recombination sites accumulate (27,20). We tested 
the ZFRs under these conditions (Figure 4C). ZFRB1, 
ZFRB1VF and ZFR(L6) all cleaved pRec(Z22) very effi- 
ciently; nearly all the substrate DNA was cleaved at either 
one or both Z-sites. ZFRB1VF had slightly higher cleav- 
age activity than ZFRB1 (i.e. more products cleaved 
at both Z-sites), ZFR(L6) had lower activity and 
ZFRB1WT was completely inactive (as expected; see 
above). There was no evidence for any off-site ZFR 
activity, which could have been revealed by the presence of 
extra linear DNA bands in the cleavage assays. Likewise, 
the ZFRs had no recombination or cleavage activity on 
pRec(site I) (data not shown), even though the two copies 
of res site I in this plasmid are the natural target for the 
resolvase domains of the ZFRs. 

The foregoing assays used a supercoiled plasmid sub- 
strate. In many potential applications of ZFRs (for 
example, in eukaryotic cells), the substrate DNA would 
not be supercoiled, so it was important to demonstrate 
that non-supercoiled DNA is also a substrate in vitro. 
Linear substrate DNA was prepared by restriction 
enzyme digestion of pRec(Z22). The purified linear 
DNA was then treated with ZFRB1. After 1 h, much of 
the substrate DNA had reacted to give products of reso- 
lution, inversion and intermolecular recombination 
(Figure 4D). The DNA segment excised in the resolution 
reaction was present as a relaxed circle. 

Effects of Z-site core sequence length 

Our experiments in E. coli (Figure 3; see also Ref. 10) 
revealed that ZFR activity depended on the Z-site core 
sequence length. We therefore tested the effect of core 
sequence length in vitro. The in vitro results were entirely 
consistent with the in vivo observations. Only pRec(Z20), 
pRec(Z22) and pRec(Z24) were recombined by ZFRB1, 
and of these pRec(Z22) was recombined most efficiently 
(Figure 5A). Similarly, only pRec(Z20), pRec(Z22) and 
pRec(Z24) were cleaved efficiently by ZFRB1 in ethylene 
glycol buffer, though traces of linearized full-length 
plasmid DNA were observed with the other substrates. 
ZFR(L6), which has a longer interdomain linker 
than ZFRB1 (Figure 2B) had a broader spectrum of 
activity (as was also seen in vivo); pRec(Z18), pRec(Z20), 
pRec(Z22) and pRec(Z24) were all recombined 
(Figure 5B), and in ethylene glycol buffer, ZFR(L6) 
promoted some DNA cleavage of all the substrates. 

As a control, all seven of the pRec(Z) substrates were 
treated with NM resolvase, which should not recognize 
the Zif268-binding motifs at each end of the Z-sites. 



pRec(Z24), pRec(Z26), and pRec(Z28), whose Z-sites 
include most or all of the 28 bp TnJ res site I, supported 
NM resolvase-mediated recombination (Figure 5C). 
pRec(Z24) is a better substrate than pRec(Z26), 
probably because some basepairs in the Zif268-binding 
motifs of Z24 accidentally match site I (Figure 2C). 
Similar results were obtained in in vivo assays (data 
not shown). More surprisingly, NM resolvase in ethylene 
glycol buffer promoted some double-strand cleavage 
and topoisomerization of all the pRecZ substrates, even 
those with short Z-site core sequences. Bands of the 
sizes predicted for cleavage at both Z-sites [seen, for 
example, in the pRec(Z16) products] show that at least 
some of the DNA cleavage is Z-site-specific. NM resolvase 
thus retains a residual capacity to recognize and cleave its 
natural target DNA sequence even when the specific 
motifs recognized by its C-terminal helix-turn-helix have 
been replaced by zinc-finger binding motifs. 

Binding and synapsis of Z-sites by ZFRs 

To ascertain the basis of the relationship between Z-site 
length and ZFR-mediated recombination efficiency, we 
need to know the stage in the reaction pathway that is 
affected. We therefore analysed binding and synapsis of 
Z-sites by ZFRs. 

Wild-type TnJ resolvase solution monomers bind co- 
operatively to a DNA fragment containing site I of 
res to form a dimer complex. Traces of a monomer- 
DNA complex can also be detected. The activated 
mutant NM resolvase can also form monomer and 
dimer complexes with site I, but two dimer-site I 
complexes can then interact to form a synapse containing 
a resolvase tetramer. These complexes can be observed 
by non-denaturing gel electrophoresis (14,20,28). 
Structures of resolvase-site I complexes show pronounced 
bending of the site I DNA (24,25,29,30). Our ZFRs have 
resolvase catalytic domains, so they were predicted to 
make analogous complexes with Z-sites. Radiolabeled 
50 bp substrates, each containing a centrally positioned 
Z-site, were prepared by annealing oligonucleotides 
(Figure 6A). When ZFRB1WT (wild-type TnJ resolvase 
catalytic domain) was added to a Z22 substrate, gel elec- 
trophoresis revealed lower mobility bands corresponding 
to binding of a ZFR monomer or a dimer (Figure 6B), as 
predicted. When ZFRB1 (activated NM resolvase catalyt- 
ic domain) was assayed similarly, an additional strong 
band with the low mobility expected for a synaptic 
complex was observed (Figure 6B). When a longer, un- 
labeled Z22 oligonucleotide was included in the reaction 
mixture, this band was shifted to an even lower mobility 
species, confirming that it contains two DNA molecules 
(data not shown). 

We then compared ZFR binding and synapsis of the 
Z-sites with different core sequence lengths. A single 
optimal concentration (~800 nM) of each ZFR was used 
(Figure 6C). ZFRB1WT bound to all the Z-sites to form 
monomer and dimer complexes, although the longest site 
(Z28) was not bound as efficiently as the others. The sites 
that recombine most efficiently (Z20, Z22) formed lower 
mobility dimer complexes than the other sites, consistent 
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Figure 5. ZFR-mediated recombination at Z-sites with different core sequence lengths. (A) Reactions catalysed by ZFRB1 on a set of pRec substrate 
plasmids, each containing two identical Z-sites as indicated above the lanes. For economy of space only one untreated sample ['blank'; pRec(Z16) 
substrate plasmid] is shown in each panel. The layout of the figure and annotation of bands are as in Figure 4. (B) Reactions catalysed by ZFR(L6) 
on the same set of plasmids. (C) Reactions catalysed by NM resolvase on the same set of plasmids. 



with increased bending of the Z-site DNA. When the 
active recombinase ZFRB1 was used, synaptic complexes 
were observed. The Z-sites that gave the most retarded 
dimer complexes with ZFRB1WT (Z20, Z22) also gave 
the strongest and lowest mobility synapse bands. In 
some lanes, there were multiple bands with the low 
mobility associated with a synaptic complex. We therefore 
propose that the synapses may exist in different conform- 
ational states. Additional faint higher mobility bands in 
the Z20, Z22 and Z24 lanes are interpreted as products of 
double-strand cleavage of the Z-site. When the catalytic- 
ally defective mutant ZFRB1SA was used instead of 



ZFRB1, the pattern of complexes was similar, but the 
presumed cleavage product bands were absent. 

Binding and synapsis by ZFR(L6) were also assayed, to 
determine whether the long linker peptide between the 
recombinase and zinc-finger domains in this protein 
(Figure 2B) affected the pattern of complex formation. 
ZFR(L6) converted less of the DNA to synaptic 
complexes than ZFRB1 (Figure 6C), but the range of 
Z-site lengths that supported significant ZFR(L6)- 
mediated synapsis was wider than for ZFRB1. These ob- 
servations parallel the recombination and cleavage assays 
(Figures 4B and 5B), supporting the hypothesis that the 
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Figure 6. Binding and synapsis of Z-sites by ZFRs. (A) The substrates 
used for the bandshift experiments. All the substrates are exactly 50 bp, 
with a centrally positioned Z-site. The diagram shows the Z16 site (boxed), 
with the motifs recognized by the Zif268 domains in bold. The longer 
Z-sites are similar except that the core sequence is longer (Figure 2C), 
and one or more of the immediately flanking basepairs at each end of 
the site (in italics) is deleted. The asterisk indicates the position of the 
5'- 32 P label. (B) Binding of the Z22 site by varying concentrations of 
ZFRB1WT (left panel) and ZFRB1 (right panel). Estimated ZFR concen- 
trations are indicated above each lane. The band marked 'cleavage' is a 
labelled half-site cleavage product with a ZFR monomer covalently at- 
tached via the active site serine residue; see text for details. (C) Binding 
of Z-sites (as indicated above the lanes) by ZFRB1 (top left), ZFRB1SA 
(top right), ZFRB1WT (bottom left) and ZFR(L6) (bottom right). 



level of ZFR catalytic activity reflects the extent of 
synapse formation. 



DISCUSSION 

Design of ZFRs 

It was shown previously that residues 1-144 of resolvase 
are sufficient for ZFR activity (10). However, ZFRB2 



which has a very short (two residues) linker between 
residue R144 and the zinc-finger domain is inactive. In 
contrast, residues 1-148 of resolvase can be connected to 
the zinc-finger domain by the same two-amino acid linker 
to give a very efficient recombinase (ZFRB1). We propose 
(see below) that the linker of ZFRB2 is not long enough to 
allow the two domains to interact properly with any 
Z-site. The extra four amino acids in ZFRB1 might 
contribute to its high recombination activity by extending 
the Tn3 resolvase domain to 148 residues, as well as by 
lengthening the protein. Analogous active ZFRs with a 
5-amino acid linker between Tni resolvase residues 1- 
144 and a zinc-finger domain have been described 
(12,17-19). 

An E. co//-based assay was used to select ZFRB1 
mutants with rate-enhanced activity (deletion of a 
marker gene). However, subsequent in vitro analysis of 
one of these mutants (ZFRB1VF) revealed undesirable 
side-reactions (see below). 

ZFR recombination and cleavage in vitro 

If ZFRs are to become practical tools for programmed 
DNA rearrangements at chosen genomic target sites, it 
is important that we fully understand their activities and 
mechanisms of action. With an in vitro system, we can 
study the basic properties of ZFRs in much greater 
detail than is possible in vivo. 

Tni resolvase-derived ZFRs can catalyse efficient, 
site-specific reciprocal recombination in vitro. The DNA 
ends are properly re-ligated; double-strand break inter- 
mediates do not accumulate to a high level except when 
induced by special reaction conditions. The activity of the 
most efficient ZFR tested here (ZFRB1) is strikingly 
similar to that of the activated Tni resolvase variant 
(NM) from which its catalytic domain is derived. As we 
had predicted, activating mutations in the ZFR catalytic 
domain are essential; ZFRB1WT is identical to ZFRB1 
except that it has the wild-type Tni resolvase catalytic 
domain, and it has no recombination activity. 

ZFR(L6), first characterized by Akopian et al. (10), 
which has a longer interdomain linker (Figure 2B), was 
less efficient in vitro than ZFRB1, but it was also less 
sensitive to the length of the Z-site core sequence (as dis- 
cussed in more detail below). We also tested a ZFRB1 
mutant, ZFRB1VF, which was selected for its enhanced 
rate of recombination in E. coli. The single additional 
mutation V107F in ZFRB1VF is at a position where 
activating mutations have been observed previously (22). 
ZFRB1VF gave low levels of recombinant products 
in vitro and caused accumulation of substrate plasmid 
DNA containing single- or double-strand breaks. This 
increased propensity to leave strand breaks at the Z-sites 
might be undesirable in applications of ZFRs, as it could 
lead to genetic damage. 

The ZFRs are highly specific for recombination at 
Z-sites; there was no evidence for off-site activity at 
'pseudosites', which would have produced extra bands 
on the gels shown in Figure 4. We estimate that we 
could have observed cleavage of <5% of the DNA at 
any pseudosite. ZFRs thus appear to have higher site 
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specificity than the parent NM resolvase, which does 
display some off-site activity (Figure 4; Ref. 20). All the 
active ZFRs promoted very efficient cleavage of a sub- 
strate with Z22 sites under reaction conditions where 
re-ligation of DNA ends is inhibited (Figure 4C). This 
suggests that the lower recombination efficiency of 
ZFRB1VF and ZFR(L6) is because of a defect in steps 
after cleavage. 

Most of our assays involved the use of negatively super- 
coiled DNA substrates. However, a linear DNA molecule 
was also recombined efficiently (Figure 4D), showing that 
ZFR recombination does not require supercoiling. 
Further experiments will be necessary to determine any 
more subtle effects of DNA topology, which might be 
important for specific applications of ZFRs. 

Dependence of ZFR activity on Z-site core sequence 
length 

Our results (Figures 3 and 5) dramatically confirm the 
surprisingly specific dependence of ZFR recombination 
and cleavage activity on the length of the Z-site core 
sequence, first noted by Akopian et al. (10). All the ZFR 
variants that we have tested recombine Z22 sites most ef- 
ficiently, in E. coli and in vitro. Sites with 21 and 23 bp core 
sequences (Z21, Z23) were recombined slightly less effi- 
ciently than Z22 sites in E. coli (data not shown). The 
preference for the Z22 site is due to its length, not any 
specific sequence motif, because the same length was 
preferred when a set of Z-sites with substantially different 
core sequences was tested [(10), and data not shown]. 

When we began this work, we had expected to find that 
ZFRs with short inter-domain linkers would prefer to act 
on Z-sites with short core sequences, and that as the core 
sequence length was increased, longer linkers would be 
preferred that can stretch between the zinc-finger 
domains at the ends of the Z-site and the catalytic 
domains at the centre (Figure IB). In vivo analysis failed 
to reveal any such correlation. However, in vitro, we do see 
significant effects of the linker length. The ZFR(L6) linker 
is 8 amino acids longer than that of ZFRB1 (Figure 2B). 
ZFRB1 recombination and cleavage activity was restricted 
to the Z20, Z22 and Z24 substrates (Figure 5A), whereas 
ZFR(L6) also recombined pRec(Z18), and gave cleavage 
products with all substrates except pRec(Z16) (Figure 5B). 
We suggest that the linker length is the main cause of this 
difference between ZFRB1 and ZFR(L6) activity (al- 
though we cannot exclude the possibility that the amino 
acid sequence of the linker is a significant factor). ZFRs 
with long linkers might therefore be useful to target sites 
where the motifs to be recognized by the zinc-finger 
domains are not spaced exactly 22 bp apart, but they 
might also have reduced specificity. 

Our results reveal that effects of core sequence length 
are apparent even at the early reaction steps of binding 
and synapsis (Figure 6). The Z-sites that are recombined 
most efficiently (Z20, Z22) also give the strongest, lowest 
mobility synapse bands and the lowest mobility dimer 
complex bands; the low electrophoretic mobility suggest- 
ing increased DNA bending. In contrast, the longest Z-site 
(Z28) was bound rather poorly by the ZFRs. 



To gain structural insight into these effects, complexes 
of ZFRs with all the Z-sites (Z16-Z28) were modelled 
using available crystal structures of a jS resolvase 
dimer-DNA complex, a synaptic yS resolvase-DNA inter- 
mediate and a Zif268 zinc-finger domain-DNA complex 
(see 'Materials and Methods' section for details). No steric 
clashes within the two ZFR subunits bound to any single 
Z-site were predicted. Likewise, no clashes between any of 
the four subunits in a synapse of two Z-sites were pre- 
dicted (Figure 7A; and data not shown). The straight-line 
distance D between the a-carbon atoms of amino acids 
that would be at either end of the ZFR linker (R144 or 
R148 of the resolvase domain and R3 of the Zif268 
domain) was measured from the models (Figure 7B). 
The feasibility of positioning the two domains of any par- 
ticular ZFR in the modelled arrangements can be 
estimated from D, assuming a maximum extended linker 
peptide length of ~3.6 A per amino acid. In ZFRB1 the 
extended length of the linker between the a-carbon atoms 
of resolvase R148 and Zif268 R3 (R-T-S-E-R) would be 
~14.4 A. The models predict that this linker would be 
long enough to reach between the two domains when 
ZFRB1 binds to Z20 or Z22 (the sites that recombine 
most efficiently), but not when it binds to the other 
Z-sites (Figure 7B). We therefore speculate that the 
ZFRB1 linker is too short for optimal productive 
binding to the Z-sites other than Z20 or Z22. Parts of 
the resolvase and/or zinc-finger domains might have to 
flex and/or alter their secondary structure from that seen 
in the crystals in order to allow the protein to span the 
shorter and longer Z-sites, and this might reduce profi- 
ciency for synapsis and recombination. On the longest 
sites, ZFR subunits bound at each end via their zinc 
finger domains might be too distant for a strong dimer 
interaction of the catalytic domains; hence binding is 
weaker. 

The structural models predict that the ZFRB2 linker 
between resolvase R144 and Zif268 R3 (also R-T-S-E-R; 
~14.4 A) is not long enough to allow proper interaction 
with any of the Z-sites (Figure 7B), in accord with our 
results showing that ZFRB2 is inactive in E. coli. The 
corresponding (R144-R3) linker length for ZFR(L6) is 
16 amino acids, which when fully extended would be 
~57 A long. This is more than D even for the longest 
site (Z28; D ~35 A). Actually, the linker would need to 
be much longer than 35 A for proper binding to Z28, 
because the N-terminus of the Zif268 domain and the 
C-terminus of the resolvase domain are on opposite 
sides of the double helix, so the linker would need to 
wrap around the DNA. Inspection of the Z28-ZFR 
models suggests that the ZFR(L6) linker should be long 
enough, yet binding, synapsis, cleavage and recombin- 
ation of the Z28 site are defective. At present, we can 
only speculate on the reasons for this defective interaction. 
The effective maximum length of the ZFR(L6) linker 
might be less than the fully extended length, due to sec- 
ondary structure; or, a negative entropy change upon 
stretching out the linker might disfavour binding to the 
long Z-sites. 

In summary, we can trace the dependence of recombin- 
ation efficiency on Z-site core sequence length back to 
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Figure 7. Modelling the interactions of ZFRs with Z-sites. (A) Molecular models of a ZFR bound to the Z22 site. The left panel shows a model of a 
ZFR dimer bound to Z22. The model is built from the resolvase-DNA structure 1GDT and two copies of the Zif268 domain-DNA structure 1AAY. 
The right panel shows a ZFR synaptic tetramer, built from four copies of 1AAY and the resolvase synaptic complex structure 1ZR4. The resolvase 
domains are blue, the Zif268 domains are yellow, and the DNA is grey. The positions of resolvase residues R144 and R148, and Zif268 domain 
residue R3, are indicated by the magenta, red and cyan blotches respectively. The black loops indicate the approximate locations of the ZFR linker 
peptide. (B) The distance D between the a-carbon atoms of resolvase residue R148 (or R144) and Zif268 domain residue R3 predicted from the 
molecular models, plotted against the Z-site core sequence length. Note that E2, the first residue of the Zif268 domain in the ZFRs (Figure 2B), 
could not be used for the measurements as it was not resolved in the 1AAY crystal structure. The plotted D values are averages of the values from 
the two crystallographically non-identical ZFRs in each of the lGDT-based models, or from the four non-identical ZFRs in each of the lZR4-based 
models. 



effects on binding of the ZFR to a single Z-site, prior to 
synapsis or any catalytic steps. However, we still cannot 
give a complete explanation of why a core sequence of 
exactly 22 bp between the zinc-finger binding motifs is 
preferred by all the ZFRs we have tested, regardless of 
ZFR linker length. The structural models predict that 
binding of very long Z-sites may be abnormal or ineffi- 
cient, thus reducing recombination efficiency. But why do 
the shorter Z-sites (Z16, Z18) bind ZFR efficiently, yet fail 
to synapse and recombine? One speculation proposes that 
synapsis and catalysis depend on proper interaction of the 
resolvase arm region with the minor groove of the DNA, 
as shown in Figure 2A. In the shorter Z-sites, the all-AT 
DNA sequence contacted by the arm region is adjacent to 
the all-GC motif recognized by the first finger of the 



Zif268 domain. It might be unfavourable to abut the 
DNA bound by the arm region, with its unusually 
narrow minor groove, to the DNA bound by the Zif268 
domain, which also has an unusual structure with a rather 
wide minor groove (26). An answer to this question will 
require further experiments. 

Use of ZFRs in biotechnology 

We aim to gain a detailed understanding of the properties 
and design of ZFRs in order to allow them to become 
useful tools for the promotion of specific genetic re- 
arrangements (3). In this section, we outline some of the 
main obstacles in the way towards widespread applicabil- 
ity of ZFRs, which might be overcome by approaches 
involving further in vitro analysis. 
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A typical application might be to promote integration 
of a transgene at a specific site in the genome of an 
organism. In most cases, it would be necessary to use 
two ZFRs with different DNA binding specificities to 
target a natural sequence, one to bind at each end of the 
'Z-site'. Another recombinase with a compatible catalytic 
domain, which recognizes a site on the transgenic DNA, 
might also be needed (Figure 8). To promote such a 
reaction effectively, several issues will need to be ad- 
dressed. First, the ZFR catalytic domains must promote 
recombination at the core sequence of the chosen target 
'Z-site', and this sequence might be very different from 
that of TnJ res site I. Towards this end, modifications of 
the catalytic domain that broaden or alter specificity for 
the Z-site core sequence have been reported recently 
(12,13,19), but it is not clear yet if these specificity 
changes are accompanied by undesirable effects on cataly- 
sis (for example, incomplete re-ligation of broken DNA 
strands, as we found with the ZFRB1VF variant). Second, 
strategies will be needed to favour ZFR-mediated integra- 
tion over the reverse reaction, deletion. ZFRs can promote 
integration in vivo (18,31) and equivalent intermolecular 
reactions in vitro (see for example Figure 4B). However, 
the Z-sites flanking the integrated DNA can recombine 
again, so recombination will tend towards an equilibrium 
of integration and deletion products. Deletion is thermo- 
dynamically favoured because the two separate deletion 
product DNA molecules have higher entropy than the 
single integration product molecule. No advances on this 
problem have been reported to date. A third imperative 
will be to minimize inappropriate subunit interactions 
when two or more different ZFRs are present, as in the 
scenario shown in Figure 8. If all the ZFR catalytic 
domains are identical, a single ZFR subunit bound cor- 
rectly at one end of a target site might dimerize with a 
subunit whose zinc-finger domain is not specific for the 
other end of the site. Although these undesirable inter- 
actions are reversible, they might nevertheless inhibit re- 
combination, reduce the effective ZFR concentration, and 
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Figure 8. ZFR-mediated site-specific transgene integration. Cartoon 
showing hypothetical ZFR-mediated integration of a transgenic con- 
struct into a genomic target site. The genomic site is recognized by a 
heterodimer of the magenta and orange ZFR subunits, whereas the site 
on the transgene DNA is recognized by a homodimer of the grey ZFR. 
The diagram on the right illustrates how, in this scenario, inappropriate 
on-site dimerization could lead to unproductive complexes. 



increase the likelihood of off-site reactions. Analogous 
problems arise in applications of zinc-finger nucleases 
(ZFNs), and protein modifications to block the formation 
of inappropriate ZFN dimers have been described (32-35); 
similar approaches should be applicable to ZFRs. Fourth, 
off-target activity must be limited to a very low level if 
ZFRs are to be used in demanding applications such as 
gene therapy, where unintended genetic damage could 
have very serious consequences. The natural biochemistry 
of resolvase and other possible progenitor serine recom- 
binases, requiring on-site dimerization and dimer-dimer 
synapsis prior to catalysis (20), suggests that ZFRs 
might have the potential for very high target specificity, 
perhaps more so than ZFNs (36). However, no critical 
experimental evidence on this issue is available as yet. 
The level of off-target activity will almost certainly be 
affected by aspects of ZFR design such as the origin of 
the catalytic domain, the activating mutations in the cata- 
lytic domain, and the characteristics of the inter-domain 
linker peptide, and these effects could be quantified by 
in vitro analysis. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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